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PROCESS FOR MAKING AND PROGRAMMING AND OPERATING A DUAL-BIT MULTI-LEVEL 

BALLISTIC MONOS MEMORY 

BACKGROUND OF THE INVENTION 

1) Field of Invention 

The invention relates to methods of forming high-density Metal/polysilicon 
Oxide Nitride Oxide Silicon (MONOS) memory arrays and the resulting high density MONOS 
memory arrays. 

1 ) Description of Prior Art 

Floating gate and MONOS are two types of non-volatile memories. In 
conventional floating gate structures, electrons are stored onto a floating gate, by either F-N 
tunneling or source side injection. Conventional MONOS devices store electrons usually by direct 
tunneling in the Oxide-Nitride-Oxide (ONO) layer which is below the memory word gate. Electrons 
are trapped in the Nitride layer of the ONO composite. The MONOS transistor requires one less 
polysilicon layer than the floating gate device, which simplifies the process and could result in a 
denser array. 

MONOS structures are conventionally planar devices in which an ONO 
composite layer is deposited beneath the word gate. The thickness of the bottom oxide of the 
ONO layer is required be less than 3.6nm, in order to utilize direct tunneling for program 
operations. However in 1998, a MONOS structure with a bottom oxide thickness of 5.0nm, and 
side wall polysilicon gates and source side injection program was first reported by Kuo-Tung 
Chang et al, in, U A New SONOS Memory Using Source Side Injection for Programming", IEEE 
Electron Letters . Vol.19, No. 7, July 1998. In this structure, as shown in Fig. 1, a side wall spacer 
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20 is formed on one side of the word gate by a typical side wall process, and the ONO composite 
22 is underneath the side wall gate, instead of under the word gate as for conventional MONOS 
memory cells. The channel under the SONOS side wall control gate is larger than 100nm, so the 
program mechanism is source side injection, which is faster and requires lower voltages than 
electron tunneling, despite the thicker bottom oxide. During source side injection, a channel 
potential is formed at the gap between the side wall gate and the select/word gate. Channel 
electrons 30 are accelerated in this gap region and become hot enough to inject into the ONO 
layer. Thus Kuo-Tung Chang's SONOS memory is able to achieve better program performance 
than previous direct tunneling MONOS cells. 

While the SONOS memory cell is unique among MONOS memories for its 
split gate structure and source side injection program, its structure and principles of program are 
similar to those for a conventional split gate floating gate device. Both cell types have a word gate 
and side wall spacer gate in series. The most significant differences lie in the manner of side wall 
gates utilization and electron storage regions. In the split gate floating gate cell, the side wall 
spacer is a floating gate onto which electrons are stored. The floating gate voltage is determined 
by capacitance coupling between the word gate, diffusion, and floating gate. For the SONOS cell, 
electrons are stored in the nitride region beneath the side wall spacer, which is called the control 
gate. The nitride region voltage is directly controlled by the voltage of the above side wall gate. 

A floating gate memory cell having faster program and higher density was 
introduced in co-pending U.S. Patent Application Serial Number 09/313,302 to the same inventors, 
filed on May 17, 1999. Fig. 3A is an array schematic and Fig. 3B is a layout cross-section of this 
fast program, dual-bit, and high density memory cell. In this memory structure, high density is 
achieved by pairing two side wall floating gates to one word gate (for example, floating gates 312 
and 313 and word gate 341), and sharing interchangeable source-drain diffusions (321 and 322) 
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between cells. Thus a single memory cell has two sites of electron storage. Additional polysilicon 
lines "control gates" run in parallel to the diffusions and orthogonal to the word gates. The control 
gates (331 and 332) couple to the floating gates and provide another dimension of control in order 
to individually select a floating gate from its pair. This memory is further characterized by fast 
programming due to ballistic injection. Using the same device structure, if the side wall gate 
channel is reduced to less than 40nm with proper impurity profiles, the injection mechanism 
changes from source side injection to a new and much more efficient injection mechanism called 
balllistic injection. The ballistic injection mechanism has been proven by S. Ogura in "Step Split 
Gate Cell with Ballistic Direction Injection for EEPROM/Flash", IEDM 1998, pp.987. In Figure 2A, 
results between ballistic injection (line 25) and conventional source side injection (line 27) are 
compared for a floating gate memory cell. Although the structures are very similar, when the 
control gate is 100nm, the injection mechanism is source side injection. However, as illustrated in 
Fig. 2B, when the channel is reduced to 40nm to satisfy the short channel length requirement for 
ballistic injection (line 35), program speed increases by three orders of magnitude under the same 
bias conditions, or at half of the floating gate voltage requirement for source side injection (line 37). 

In contrast, the side wall channel length of Kuo Tung Chang's SONOS 
memory structure is 200nm, so the program mechanism is source side injection. Thus there is a 
significant dependence between the short channel length and the injection mechanism. 

SUMMARY OF THE INVENTION 

In this invention, a fast low voltage ballistic program, ultra-short channel, 
ultra-high density, dual-bit multi-level flash memory is achieved with a two or three polysilicon split 
gate side wall process. The structure and operation of this invention is enabled by a twin MONOS 
. cell structure having an ultra-short control gate channel of less than 40nm, with ballistic injection 
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(S. Ogura) which provides high electron injection efficiency and very fast program at low program 
voltages of 3~5V. The cell structure is realized by (i) placing side wall control gates over a 
composite of Oxide-Nitride-Oxide (ONO) on both sides of the word gate, and (ii) forming the 
control gates and bit diffusion by self-alignment and sharing the control gates and bit diffusions 
between memory cells for high density. Key elements used in this process are: 

(i) Disposable side wall process to fabricate the ultra short channel and the side wall control 
gate with or without a step structure. 

(ii) Self-aligned definition of the control gate over the storage nitride and the bit line diffusion, 
which also runs in the same direction as the control gate. 

The features of fast program, low voltage, ultra-high density, dual-bit, multi-level MONOS NVRAM 
of the present invention include: 

1. Electron memory storage in nitride regions within an ONO layer underlying the control gates. 

2. High density dual-bit cell in which there are two nitride memory storage elements per cell 

3. High density dual-bit cell can store multi-levels in each of the nitride regions 

4. Low current program controlled by the word gate and control gate 

5. Fast, low voltage program by ballistic injection utilizing the controllable ultra-short channel 
MONOS 

6. Side wall control poly gates to program and read multi-levels while masking out memory 
storage state effects of the unselected adjacent nitride regions and memory cells. 

The ballistic MONOS memory cell is arranged in the following array: each 
memory cell contains two nitride regions for one word gate, and V4 a source diffusion and 1 /4 a bit 
diffusion. Control gates can be defined separately or shared together over the same diffusion. 
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Diffusions are shared between cells and run in parallel to the side wall control gates, and 
perpendicular to the word line. 

A summary of the operating conditions for multi-level storage is given in 
Figure 3B. During read, the following conditions need to be met: the voltage of the unselected 
control gate within a selected memory cell must be greater than the threshold voltage of the 
control + source voltage. The word select gate in the control gate pair is raised to the threshold 
voltage of the word gate + an override delta of around 0.5V + source voltage (Vt-wl + Voverdrive 
+Vs). Un-selected MONOS cells will be disabled by reducing the associated control gates to 0V. 
Program conditions are: Word line voltage is greater than threshold + an overdrive voltage delta 
for low current program. Both control gates in the selected pair are greater than Vt-high (the 
highest threshold voltage within the range of multi-level thresholds) + override delta. Adjacent 
memory cells sharing the same word line voltage are disabled by adjusting the control gates only. 

BRIEF DESCRIPTION OF THE DRAWINGS 

In the accompanying drawings forming a material part of this description, 

there is shown: 

Figure 1 is a device structure of prior art SONOS (Silicon Oxide Nitride 

Oxide Silicon). 

Figure 2A graphically represents empirical results for a split gate floating 
gate transistor, demonstrating that for a channel length of 100nm, source side injection requires 
high voltage operation. 
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Figure 2B graphically represents empirical results for a split gate floating 
gate transistor showing that for a channel length of 40nm, ballistic injection operates at much lower 
voltages and/or much faster program speed. 

Figure 3A is an array schematic of the prior art double side wall dual-bit split 
floating gate cell with ultra short ballistic channel. 

Figure 3B is a layout cross-section of the prior art double side wall dual-bit 
split floating gate cell with ultra short ballistic channel. 

Figs. 4A through 4F are cross sectional representations of a first preferred 
embodiment of the process of the present invention. 

Fig. 4G is a bird eye's view of the completed memory cell of the present 

invention. 

Figs. 5B, 5C, and 5F are cross sectional representations of a second 
preferred embodiment of the process of the present invention. 

Figs. 6A through 6F are cross sectional representations of a third preferred 
embodiment of the process of the present invention. 

Figure 7A is an array schematic of the present invention. 

Figure 7B is a cross-sectional representation of the present invention. 
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Figure 7C gives the required voltage conditions during read for the present 

invention. 

Figures 8A, 8B, and 8C are graphical representations of voltage sensing 
curves for the present invention during read. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Presented in this invention is a fabrication method for a ballistic twin 
MONOS memory cell with two nitride memory elements and two shared control gates. The 
method can be applied to a device with a flat channel and/or a device having a step channel under 
the nitride layer in the MONOS cell. 

The procedures for formation of shallow trench isolation, p-well, and n-well 
are the same as for conventional CMOS processing and will not be shown. The polysilicon word 
gate is also defined by conventional CMOS processing as shown in Fig. 4A. In order to define the 
word gate, the memory gate silicon oxide 221 is formed to a thickness of between about 5 and 10 
nanometers. Then the polysilicon 245 with a thickness of between about 150 and 250 nm for the 
gate material is deposited by chemical vapor deposition (CVD). A nitride layer 232 is deposited by 
CVD to a thickness of between about 50 and 100nm to be used later as an etch stop layer for 
chemical mechanical polishing (CMP). Normal CMOS processing defines the memory word gates; 
i.e., photoresist and masking processes with exposure, development, and vertical etching of the 
nitride 232 and polysilicon 245 by reactive ion etching (RIE) are performed. Extra boron 202 is ion 
implanted at low energy (less than about 10KeV energy) with an ion dosage of between 3E12 to 
3E13 ions per cm 2 , in order to adjust VT under the floating gate. After removing the photoresist 
which was used to define the word gate, the word gate is obtained as shown in Fig 4A. 
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A thin silicon oxide layer 234 of between about 5 and 10 nm can be 
thermally grown on the side wall polysilicon, or Si0 2 and/or SiN film can be deposited by uniform 
CVD, as shown in Fig 4B. Then the disposable sidewall process, which defines a controllably 
short channel and provides fast programming by high electron injection efficiency, is performed. A 
thin polysilicon layer typically having a thickness of between about 30 to 50 nm is deposited. Then 
a vertical or anisotropic polysilicon etch is performed, which forms the disposable sidewall spacer 
242 on both sides of the word gate 245, as shown in Fig.4B. Implantation with an N dopant 203 
such as arsenic is performed with an ion dosage of between 3E13 and 4E13/cm 2 at 10 to 15KeV. 
Thus, the thickness of the polysilicon layer determines the effective channel length under the 
control gate. 

Referring now to Fig. 4C, the disposable side wall spacer 242 is gently 
removed by a dry chemical anisotropic etch. A typical etch ambient for this step is HBr/CI 2 /0 2 . The 
bottom silicon oxide 221 is then gently etched out by buffered (with for example water of 
ammonium hydroxide) hydrofluoric acid (BHF), Vapor HF, or a reactive ion etch such as CF 2 /0 2 . 
A composite layer of oxide-nitride-oxide 230 is formed. Layer 230 is shown without the three layers 
for simplicity. The bottom oxide is thermally grown and the thickness is between 3.6 and 5 nm, 
which is slightly thicker than the limit of direct tunneling (3.6nm), the silicon nitride layer deposited 
by chemical vapor deposition is about 2 to 5 nm, and the top oxide is deposited by CVD deposition 
and is between about 4 and 8 nm. Thermal oxidation may be added to improve the top oxide 
quality. Also short nitridation in an N 2 0 environment can be added to improve the bottom oxide 
reliability prior to the deposition of the nitride layer. 

Now, an insitu phosphorus-doped thin polysilicon layer between about 30 
and 50 nm and tungsten silicide between 60 and 100 nm is deposited by CVD. The composite 
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layer of polysilicon and tungsten silicide becomes the control sidewall spacer gate. A vertical, 
anisotropic reactive etch is performed to form the sidewall control gate 240, as shown in Fig. 4C. 
The composite oxide-nitride-oxide layer is also etched through, leaving this ONO layer 230 only 
underlying the sidewall control gates. 

A thin CVD of silicon oxide or nitride 233 with a thickness of about 10nm is 
deposited. Phosphorus and /or Arsenic for n+ junction 204 is implanted subsequently, at a dosage 
of between 3E14 to 5E15 ions per cm 2 , as shown in Fig.4C. The total thickness is between 90 to 
150 nm, which is equal to the summation of effective control gate channel length and lateral out 
diffusion of the n+ junction. 

As an option, the sidewall spacer gate 240 can be simply an insitu 
phosphorus or As doped polysilicon layer instead of the composite layer of polysilicon and 
tungsten silicide. After the formation of the n+ junction and the deposition of a thin CVD of silicon 
oxide or nitride 233 with a thickness of about 10nm, the vertical reactive ion etch is performed to 
form sidewall oxide spacer 233 on the gate 240 when the control gate requires low resistivity and 
silicidation, as shown in Fig. 4D. In typical silicidation, about 10 nm Co orTi is deposited by 
plasma sputtering and a Rapid Thermal Anneal at about 650 °C is performed. The formation of 
silicide layer 241 on the top part of gate 240 and diffusion 204 are shown in Fig 4D. Although 
silicidation 241 is shown in Fig. 4D, it is not required. It is an option to reduce the RC time 
constant of the control gate lines or diffusion lines in order to improve performance in all modes of 
operation, read, program, and erase. 

An oxide and/or nitride layer 235 for contamination barrier is deposited by 
CVD. Then a layer of CVD silicon oxide or BSG 247 is deposited to fill the gap. The gap fill 
material is polished by CMP up to the nitride layer 232. 
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As an option, the gap fill material 247 can be a conductive material like 
polysilicon or W, which can be used for reducing the RC time constant of the sidewall gate or bit 
diffusion depending on the need. When the conductive layer is polished by CMP up to the nitride 
layer 232, the conductive layer is several hundred nanometers (50nm) recessed by vertical 
reactive ion etch. Then a CVD Si0 2 layer (about 50nm) is deposited and CMP is performed as 
illustrated by 236 as shown in Fig 4E. 

The nitride layer 232 in Fig. 4E is selectively etched by H 3 P0 4 or etched by 
a chemical dry etch. The polysilicon layer thickness of between 150 and 200 nm is deposited by 
CVD. This polysilicon layer 248 and the underlying polysilicon word gate 245 are defined by 
normal photoresist and RIE processes. The structure at this point is as shown in Fig. 4F. 

The polysilicon layer 248 acts as a word line wire by connecting adjacent 
word line gates. The final memory cell is completed at this point. This word polysilicon layer can 
be silicided with Ti or Co to reduce the sheet resistance. A typical bird's-eye view of the memory 
cell is shown in Fig 4G. The shallow trench isolation region is shown by area 209 in Fig. 4G. 

The preceding processes describe fabrication of planar channel floating 
gates with very short channel (30 to 50nm). By modifying and adding a few process steps, a step 
split structure with more efficient ballistic injection can be fabricated using the same process 
integration scheme as the planar structure. This second embodiment of the present invention will 
be described with reference to Figs. 5B, 5C, and 5F. 

After forming disposable sidewall spacer 242 by etching vertically the doped 
polysilicon, the silicon oxide layer 221 is vertically etched which corresponds to Fig 4B. In order to 
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form a step split memory cell, the deviation starts at this point by continuing to etch into the silicon 
substrate by approximately 20 to 50nm. Then the bottom of the step is lightly implanted with 
Arsenic to form N-region 203 using the poly sidewall as a mask as shown in Fig 5B, where the 
dosage is about 3E13 to 4E13/cm 2 at 10 to 15KeV. Next, the N+ doped polysilicon disposable 
spacer is selectively removed by a wet etch (HNCVHF/Acitic acid, or H 3 P0 4 or NH 4 OH) or a dry 
plasma etch to the lightly doped bulk N- region. The bulk etching during this disposable spacer 
etch can be included as part of step etching. After gently etching off the left over gate oxide 221 
under the disposable polysilicon spacer, the silicon surface is cleaned. The total step into silicon 
should be about 20 to 50 nm. If the step corner is sharp, corner rounding by rapid thermal anneal 
(RTA) at between about 1000 to 1 100° C for about 60 seconds can be added as an option or a 
hydrogen anneal at 900°C and at a pressure of 200 to 300 mtorr can be performed. After these 
modifications and additions, the fabrication sequence returns to the procedures described 
previously. 

Referring to Fig. 5C, a composite layer of oxide-nitride-oxide is formed. 
Layer 230 is shown without the three layers for simplicity. The bottom oxide is thermally grown 
and the thickness is between 3.6 and 5 nm, which is slightly thicker than the limit of direct 
tunneling (3.6nm), the silicon nitride layer deposited by chemical vapor deposition (CVD) is about 
2 to 5 nm, and the top oxide is deposited by CVD deposition and is between about 4 and 8nm. 
Thermal oxidation may be added to improve the top oxide quality. Also, short nitridation in an N 2 0 
environment can be added to improve the bottom oxide reliability prior to the deposition of the 
nitride layer. 

Then an insitu phosphorous-doped polysilicon layer, which becomes the 
control gate, is deposited having a thickness of between 90 to 180 nm, and a vertical or 
anisotropic polysilicon etch is performed to form the sidewall gate 240, a shown in Fig 5C. By 
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following the process steps given for the planar split device, the step-split device can be fabricated 
as shown in Fig 5F. This sidewall polysilicon gate can be silicided or replaced by refractory silicide 
as utilized in the first embodiment of the flat channel MONOS twin celL 

In the above process steps for both the planar and step devices, the 
disposable side wall spacer 242 can be plasma nitride or oxynitride or Boron Phosphorus Silicate 
Glass (BPSG) instead of polysilicon, since the etching rate of that material to the thermal silicon 
oxide can be very high (for example at least 10-100 times) in H 3 P0 4 acid or diluted HF. 

A third embodiment of the present invention will be described with reference 
to Figs. 6A-6D and 6F. The third embodiment of the present invention will be a simplified process 
of the first embodiment of the planar twin MONOS memory cell with a slight program speed 
penalty because controllability will be lost due to the usage of a single large spacer instead of two 
side wall spacers. Deviation from the normal CMOS process starts prior to deposition of word 
gate polysilicon 245. A composite layer of oxide-nitride-oxide (ONO), 230 in Fig 6A, is formed. 
Layer 230 is again shown without the three layers for simplicity. The bottom silicon oxide layer is 
preferred to be grown thermally with a thickness of between about 3.6nm to 5nm, the silicon nitride 
layer deposited by CVD deposition is about 2 to 5 nm and the top oxide layer is deposited by CVD 
deposition and about 5 to 8 nm thick. The top oxide CVD layer is slightly thicker compared to the 
first and second process embodiments, for subsequent polysilicon and disposable sidewall spacer 
etch stop. Then the polysilicon 245 for gate material is deposited by CVD and followed by CVD 
silicon nitride 232 deposition thickness of between about 50 to 100 nm. 

Then a photoresist layer is formed and a masking process with exposure 
and development to define memory gates 245 are performed. The polysilicon layer is now etched 
vertically by reactive ion etching (RIE), using the under layer top silicon oxide in the composite 
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layer 230 as a etch stop. Then extra boron 202 is ion implanted at low energy (less than 10 keV 
power and ion dosage of between about 5E 12 to 2 E13 ions per cm 2 , also shallow As is implanted 
at the same time at about 5E12 to 1.5E13 at the same KeV range as is the boron, as shown in Fig. 
6A. Even though the channel threshold is very low due to As compensation, there is plenty of 
impurity to create a channel potential drop in the short channel region. (See co-pending U.S. 

Patent Application Serial Number 09/ to the same inventors, filed on , 1999. (Halo 

99-003) This step is shown in Fig. 6A. 

A thin silicon oxide layer 234 of about 5 nm is thermally grown on the side 
of polysilicon or CVD uniformly deposited. Then a disposable polysilicon layer typically having a 
thickness of between about 90 to 150nm is deposited. Then the vertical or anisotropic polysilicon 
etch is performed, which forms the disposable sidewall spacer 243 in Fig 6B. This is a thicker 
spacer than in the first and second embodiments. Then As ions are implanted at dosage of 
between 1 E15 to 5E15 cm 2 and at the energy range of 20 to 50 KeV through the composite layer 
of oxide-nitride in order to form an N+ junction 204. By adjusting the lateral out diffusion with 
annealing temperature and time (between 850 to 900 °C and 5 to 20 min), the channel length 
defined from the edge of the word gate to the N+ junction edge is designed to be about 30 to 50 
nm (3 to 4 times the electron mean free length) for ballistic high injection efficiency at low voltage. 

Afterwards, the disposable side wall spacer 243 is gently removed by a dry 
chemical, isotropic etch. A typical etch ambient for this step is HBr/CL 2 /0 2 . The exposed silicon 
oxide over nitride is gently etched out by buffered hydrofluoric acid. A fresh silicon oxide 244 
replacing the top oxide in the composite ONO 230, shown in Fig. 6C, of about 4 to 6 nm is 
deposited by chemical vapor deposition. Thermal oxidation is added after the top layer is 
deposited to improve the top oxide quality. 
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As an option, prior to removal of the disposable sidewall spacer 243, the 
exposed top two layers of oxide-nitride are etched by RIE. Then the fresh oxide of about 4 to 6 nm 
is deposited by chemical vapor deposition and followed by thermal oxidation for the top oxide 
improvement. During this oxidation process of about 850 to 900 °C and 10 min in wet 0 2 
atmosphere an extra oxide layer of about 20nm is formed on the nitride cut area over the n+ 
junction as shown by 244 in Fig. 6D. This thick oxide reduces the coupling capacitance between 
control gate 240 and bit diffusion 204. 

A layer of polysilicon approximately 300 nm, which is slightly thicker than the 
summation of word polysilicon 245 and the top nitride 232 height, is deposited and CMP is 
performed using the nitride layer as the etch stop layer. Then the filled polysilicon layer 240 is 
recessed about 50nm by a vertical, anisotropic reactive ion etch. Then thin Ti or Co of about 10 
nm is deposited and silicidation is performed. The silicide layer 241 is to reduce the control gate 
resistance. A CVD Si0 2 deposition and CMP is performed again, as illustrated by 236. The cross 
section of the device at this point is shown in Fig. 6C and in Fig. 6D. 

Then the nitride layer 232 is selectively etched by H 3 P0 4 or etched by a 
chemical dry etch. The polysilicon layer 248 having a thickness of between 150 and 200 nm is 
deposited by CVD. This polysilicon layer and underlying word gate polysilicon 245 are defined by 
normal photoresist and RIE processes. The structure at this point is as shown in Fig 6F. 

The polysilicon layer 248 acts as a word line wire by connecting adjacent 
word line gates. The final memory cell is completed at this point. This word polysilicon layer can be 
silicided with Ti or Co to reduce the sheet resistance. A typical bird's-eye view of the memory cell 
is shown in Fig. 4G. The shallow trench isolation region is provided by the area 209. It is 



14 



HALO-99-006 

understood that these critical dimensions will scale with the technology as the critical dimension is 
reduced. 

In the embodiments described above, two approaches have been combined 
to improve memory density in this invention. In the first approach, density is more than doubled by 
sharing as many cell elements as possible. A single word select gate is shared between two 
nitride charge storage regions, and source lines/bit lines as well as control gate lines are shared 
between adjacent cells. In the second approach, multi-level thresholds are stored in the nitride 
regions under the control gates, and specific voltage and control conditions have been developed 
in order to make multi-level sensing and program possible for the high density array, with good 
margins between each of the threshold levels. 

OPERATING METHOD FOR MULTILEVEL STORAGE 

The procedures described below can be applied to multi-level storage of two 
bits or greater, as well as single-bit/two level storage applications in which Vt-hi and Vt-low are the 
highest and lowest threshold voltages, respectively, to be stored in the nitride region under the 
control gate. The dual bit nature of the memory cell comes from the association of two nitride 
regions paired to a single word gate and the interchangeability of source and drain regions 
between cells. This cell structure can be obtained by a side wall deposition process, and 
fabrication and operation concepts can be applied to both a step split ballistic transistor and/or a 
planar split gate ballistic transistor. The step split and the planar ballistic transistors have low 
programming voltages, fast program times, and thin oxides. 

A cross-section of the array for a planar split gate ballistic transistor 
application is shown in Figure 7B. All word gates 340, 341, and 342 are formed in first level 
polysilicon and connected together to form a word line 350. ONO is formed underneath the 
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sidewalls that are deposited in pairs on either side of the word gates 340, 341, and 342. The 
nitride within the ONO layer which is under each sidewall is the actual region for electron memory 
storage. These nitride regions are 310, 311, 312, 313, 314,315 in Figs. 7B and 7C. In order to 
simplify peripheral decode circuitry, two side wall control gates sharing the same diffusion will be 
connected together to form a single control gate 330, 331, 332, 333, according to process 
embodiment 3 and embodiments 1 and 2 in which the gap-filling material 247 is a conductor. In the 
cases of process embodiments 1 and 2 in which two side wall gates sharing a diffusion are 
isolated from each other (where the gap-filling material is an insulator), it is feasible to electrically 
connect these two gates together with a wire outside of the memory array. Although it is also 
possible to operate the memory array with individual sidewall gates as control gates, peripheral 
logic will become more cumbersome, which does not meet the interests of high density memory. 

Nitride regions 31 1 and 31 2 share control gate 331 , and nitride regions 31 3 
and 314 share control gate 332. A memory cell 301 can be described as having a source diffusion 
321 and bit diffusion 322, with three gates in series between the source diffusion and the bit 
diffusion, a control gate 331 with underlying nitride region 312, a word gate 341, and another 
control gate 332 with underlying nitride region 313. The word gate 341 is a simple logical ON/OFF 
switch, and the control gates allow individual expression of a selected nitride region's voltage state 
during read. Two nitride charge regions which share the same word gate will be hereinafter 
referred to as a "nitride charge region pair". Within a single memory cell 301, one nitride charge 
region 313 is selected within a nitride charge region pair for read access or program operations. 
The "selected nitride charge region" 313 will refer to the selected nitride region of a selected nitride 
pair. The "unselected nitride charge region" 312 will refer to the unselected nitride charge region of 
a selected nitride charge region pair. "Near adjacent nitride charge regions" 31 1 and 314 will refer 
to the nitride charge regions of the nitride charge pairs in the adjacent unselected memory cells 
which are closest to the selected memory cell 301 . "Far unselected adjacent nitride charge 
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regions" 310 and 315 will refer to the nitride charge regions opposite the near unselected adjacent 
nitride charge regions within the same unselected adjacent memory cell nitride charge region 
pairs. The "source" diffusion 321 of a selected memory cell will be the farther of the two memory 
cell diffusions from the selected nitride charge region and the junction closest to the selected 
nitride charge region will be referred to as the "bit" diffusion 322. 



In this invention, control gate voltages are manipulated to isolate the 
behavior of an individual nitride charge region from a pair of nitride charge regions. There are 
three control gate voltage states: "over-ride", "express", and "suppress". A description of the 
control gate voltage states follows, in which the word line voltage is assumed to be 2.0V, the "bit" 
diffusion voltage is 0V, and the "source" diffusion voltage is assumed to be 1.2V. It should be 
understood that the voltages given are examples for only one of many possible applications, 
depending on the features of the process technology, and are not to be limiting in any way. In the 
over-ride state, the V(CG) is raised to a high voltage (~5V) forcing the channel under the control 
gate to conduct regardless of the charge stored in the nitride regions. In the express state, the 
control gate voltage is raised to about Vt-hi (2.0V), and the channel under the control gate will 
conduct, depending on the programmed state of the nitride regions. In suppress-mode, the control 
gate is set to 0V to suppress conduction of the underlying channels. 

Table 1 gives the voltages during read of selected nitride region 313. 



Voltages for Read of Selected FG=313 
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*lf threshold voltage is slightly negative, it is possible to suppress the nitride threshold region with 
a slightly negative control gate voltage (about -0.7V) 

During read operation of nitride region 313, shown in Fig. 3C, the source 
line 321 can be set to some intermediate voltage (-1.2V) and the bit line 322 may be precharged 
to 0V. In addition, the following conditions must be met in order to read a selected nitride charge 
region: 1) the word select gate voltage must be raised from OV to a voltage (2.5V) which is some 
delta greater than the sum of the threshold voltage of the word select gate (Vt-wl=0.5V) and the 
source voltage (1 .2V), and 2) the voltage of the control gate above the selected nitride charge 
region must be near Vt-hi ("express"). The voltage of the control gate above the unselected nitride 
charge regions must be greater than the source voltage plus Vt-hi ("over-ride"). The control gates 
above the unselected adjacent nitride charge regions must be zero ("suppress"). The voltage of 
the bit diffusion 322 can be monitored by a sense amplifier and compared to a switch-able 
reference voltage, or several sense amplifiers each with a different reference voltage, to determine 
the binary value that corresponds to nitride charge region 313's threshold voltage, in a serial or 
parallel read manner, respectively. Thus, by over-riding the unselected nitride region within the 
selected memory cell, and then suppressing the adjacent cell unselected nitride regions, the 
threshold state of an individual selected nitride region can be determined. 

For ballistic channel hot electron injection, electrons are energized by a high 
source-drain potential, to inject through the oxide and onto the nitride. The magnitude of the 
programmed threshold voltage can be controlled by the source-drain potential and the program 
duration. Table 2 describes the voltages to program multiple threshold voltages to a selected 
nitride region 313. These voltages are for example only, to facilitate description of the program 
method, and are not limiting in any way. In Table 2A, the control gates 331 , 332 associated with 
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the selected memory cell 301 are raised to a high voltage (5V) to over-ride the nitride charge 
regions 312 and 313. 



Bit Diffusion Method Program of Selected Nitride CI 
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Table 2A 

Program of the desired threshold level is determined by the bit diffusion 322: 
The bit diffusion 322 is fixed to 5V, 4.5V, or 4.0V in order to program threshold voltages of 2.0V, 
1.6V and 1.2V, respectively. When the word line 350 is raised above the word gate's 341 
threshold, high energy electrons will be released into the channel, and injection begins. To inhibit 
program in the adjacent memory cells, the far adjacent control gates are set to 0V, so there will be 
no electrons in the channels of the adjacent memory cells. Thus, multi-level threshold program can 
be achieved by bit diffusion voltage control for this high density memory array. It is also possible to 
program multiple thresholds by varying the word line voltage, for example 4.5V, 5V and 5.5V, to 
program 1.2V, 1.6V and 2.0V, respectively. 

Another possible method of program is to vary the control gate voltage in 
order to obtain different threshold levels. If multi-levels are to be obtained by control gate voltage, 
the unselected control gate 331 within the selected memory cell 301 will be set high to 5V in order 
to over-ride nitride region 312. The control gate 332 over the selected nitride region 313 will be 
varied to 4.5V, 5V and 5.5V, to obtain threshold voltages of 1 .2V, 1 .6V and 2.0V, respectively. 
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A fourth program method variation to the voltage conditions described for 
multi-level program is given in Table 2B, in which the selected control gate voltage matches the bit 
voltage for Vd=5V, 4.5V, and 4.0V and Vcg=5V, 4.5V, and 4.0V, respectively. 



Control Gate-Bit Method Program of Selected Nitride Charge Region 313 
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Table 2B 

Because the program current is low, and by programming schemes 
described above, it is possible to program several cells on the same word line in a parallel 
operation. Furthermore, depending on the peripheral decoding circuitry, multiple thresholds may 
also be programmed simultaneously, if the program methods of bit diffusion or control gate control 
are used. It should be noted however, that selected memory cells can have no fewer than two 
memory cells between each other, in order to obtain properly isolated behavior. Also, in order to 
obtain the tight Vt margins which are necessary for multi-level operation, the threshold voltage 
should be periodically checked during program, by a program verify cycle which is similar to a read 
operation. Program verify for the ballistic short channel sidewall MONOS in this invention is 
simpler than conventional floating gate and MONOS memories because program voltages are so 
low and very similar to read voltage conditions. 

Removal of electrons from the nitride region during erase can be done by 
hot hole injection from the nitride region to the diffusion, or by F-N tunneling from the nitride region 
to the control gate. In the hot hole injection method, the substrate is grounded, diffusions are set to 
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5V and negative 5V is applied to the control gate. For F-N tunneling, a negative 3.5V is applied to 
both the substrate and diffusions and positive 5V is applied to the control gates. A block of nitride 
regions must be erased at once. A single nitride region cannot be erased. 

PREFERRED EMBODIMENT FOR READ 

Read operation for a two bit multi-level storage in each of the nitride regions 
will be described, based on simulations for a 0.25u process. Figure 8A illustrates the memory cell 
and voltage conditions for a read of nitride charge region 313. The threshold voltages for the four 
levels of storage are 0.8V, 1 .2V 1 .6V and 2.0 for the "11", "10", and "01" and "00" states, 
respectively. This is shown in Figure 8B. The threshold voltage for the word select gate is 0.5V. 
During read, the source voltage is fixed to 1.2V. The control gate above the unselected nitride 
charge region is set to 5V, which overrides all possible threshold states, and the control gate 
above the selected nitride charge region is set to 2.0V, which is the highest threshold voltage of all 
the possible threshold states. All other control gates are set to zero, and the bit junction is 
precharged to zero. The word line is then raised from OVto 1.0V, and the bit junction is monitored. 

Sensing the bit junction yields the curves shown in Figure 8C. Bit line 
voltage sensing curves 71, 73, 75, and 77 during read of nitride charge region 313 are shown for 
different thresholds 0.8V, 1.2V, 1.6V, and 2.0V, respectively. It can be seen from the voltage 
curves, that the voltage difference between each of the states is approximately 300mV, which is 
well within sensing margins. Simulation has also confirmed that the sate of the unselected cell has 
very little impact on the bit junction voltage curve in Figure 8C. 

The present invention provides a method for forming a double side wall 
control gate having an ONO nitride charge storage region underneath with an ultra short channel. 
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The enhancement mode channel is around 35nm, and is defined by the side wall spacer. The 
isolation between the word gates is formed by a self-aligned Si0 2 filling technique. The polysilicon 
control gate is formed by a self-aligned technique using chemical mechanical polishing. The 
process of the invention include two embodiments: a planar short channel structure with ballistic 
injection and a step split short channel structure with ballistic injection. A third embodiment 
provides isolation of adjacent word gates after control gate definition. 

While the invention has been particularly shown and described with 
reference to the preferred embodiments thereof, it will be understood by those skilled in the art that 
various changes in form and details may be made without departing from the spirit and scope of 
the invention. 

What is claimed is: 
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